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1 Introduction 

This document summarizes the proofs made during a Coq development in 
Summer 2015. This development investigates the function G introduced by 
Hofstadter in his famous “Godel, Escher, Bach” bookjT], as well as a related 
infinite tree. The left/right flipped variant G of this G tree has also been 
studied here, following Hofstadter’s “problem for the curious reader”. The 
initial G function is refered as sequence A005206 in OEIS[l], while the flipped 
version G is the sequence A123070 |5]. 

The detailed and machine-checked proofs can be found in the files of 
this developmenllf] and can be re-checked by running Coq [2] version 8.4 
on it. No prior knowledge of Coq is assumed here, on the contrary this 
document has rather been a “Coq-to-English” translation exercise for the 
author. Nonetheless, some proofs given in this document are still quite 
sketchy: in this case, the interested reader is encouraged to consult the Coq 
files given as references. 

2 Prior art and contributions 

Most of the results proved here were already mentioned elsewhere, e.g. on 
the OEIS pages for G and G. These results were certified in Coq without 
consulting the existing proofs in the literature, so the proofs presented here 
might still be improved. To the best of my knowledge, the main novelties of 
this development are: 

1. A proof of the recursive equation for G which is currently mentioned 
as a conjecture in OEIS: 

Vn > 3, G{n) = n + 1 — G{1 + G{n — 1)) 

^See http://www.pps.univ-paris-diderot.fr/~letouzey/hofstadter_g 


1 



2 . The statement and proof of another equation for G: 

Vn > 3, G{G{n)) + G{n — 1) = n 

Although simpler than the previous one, this equation isn’t enough 
to characterize G unless an extra monotonocity condition on G is as¬ 
sumed. 

3. The statement and proof of a result comparing G and G: for all n, G{n) 
is either G{n) -|- 1 or G{n), depending of the shape of the Zeckendorf 
decomposition of n as a sum of Fibonacci numbers. More precisely, 
when this decomposition of n starts with T 3 and has a second smallest 
term with k odd, then G{n) = G{n) -|- 1, otherwise G{n) = G{n). 
Moreover these specific numbers where G and G differ are separated 
by either 5 or 8 . 

4. The studies of G and G “derivatives” (AG(n) = G{n -|- 1) — G{n) and 
similarly for AG), leading to yet another characterization of G and G. 


3 Coq jargon 

For readers without prior knowledge of Coq but curious enough to have a 
look at the actual files of this development, here comes a few words about 
the Coq syntax. 

• By default, we’re manipulating natural numbers only, corresponding 
to Coq type nat. Only exception: in file Phi.v, we switch to real 
numbers to represent the golden ratio. 

• The symbol S denotes the successor of natural numbers. Hence S(2) 
is the same as 3. 

• As in many modern functional languages such as OCaml or Haskell, the 
usage is to skip parenthesis whenever possible, and in particular around 
atomic arguments of functions. Hence S(x) will rather be written S x. 

• The symbol pred is the predecessor for natural numbers. In Coq, all 
functions must be total, and pred 0 has been chosen to be equal to 0 . 
Similarly, the subtraction on natural numbers is rounded: 3-5 = 0. 

• Coq allows to define custom predicates to express various properties 
of numbers, lists, etc. These custom predicates are introduced by the 
keyword Inductive, followed by some “introduction” rules for the new 
predicate. In this development we use capitalized names for these 
predicates (e.g. Delta, Low, ...). 
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• Coq also accepts new definition of recursive functions via the com¬ 
mand Fixpoint. But these definitions should satisfy some criterion 
to guarantee that the new function is well-defined and total. So in 
this development, we also need sometimes to define new functions via 
explicit justification of termination, see for instance norm in Fib.v or 
g_spec in FunG.v, 


4 Fibonacci numbers and decompositions 

This section corresponds to file Fib.v, 

Convention. We use her^ the following definition of the Fibonacci num¬ 
bers Fn : 

Fo = 0 
Fi = l 

'^TL, Fn-]-2 — Fn + 

This definition is standard, see for instance OEIS’s sequence A000045 [3]. 
In Coq, these F^ numbers correspond to the fib function, and we start by 
proving a few basic properties : strict positivity except for Tq, monotony, 
strict monotony above 2, etc. We also prove the theorem fib_inv which 
states that any positive number can by bounded by consecutive Fibonacci 
numbers : 

Theorem 1 (Fibonacci inverse). Vn,3A:,F); <n < Ffc+i- 

4.1 Fibonacci decompositions 

The rest of the file Fib.v deals with the decompositions of numbers as sums 
of Fibonacci numbers. 

Definition 1. A decomposition n = TiFi is said to be canonical if: 

(a) Fq and Fi do not appear in the decomposition 

(h) A Fibonacci number appears at most once in the decomposition 

(c) Fibonacci numbers in the decomposition aren’t consecutive 

A decomposition is said to be relaxed if at least conditions (a) and (b) hold. 

^ Please note that earlier versions of this Coq development and of this document were 
using a non-standard definition of the Fibonacci sequence, where the leading 0 was omitted. 
This non-standard definition Fq = Fi = 1, Fi = 2,... was meant to ease some Coq proofs, 
but was actually providing little gain, while being quite confusing for external readers. 
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For instance, 11 admits the canonical decomposition F 4 Fq = 3 + 8 , 
but also the relaxed decomposition F 2 + -^ 3+-^4 + -^5 = 1 + 2 + 3 + 5. On 
a technical level, we represented in Coq the decompositions as sorted lists 
of ranks, and we used a predicate Delta p I to express that any element in 
the list I exceeds the previous element by at least p. We hence use p = 2 for 
canonical decompositions, and p = 1 for relaxed ones. See file DeltaList.v 
for the definition and properties of this predicate. 

Then we proved Zeckendorf’s theorem (actually discovered earlier by 
Lekkerkerker): 

Theorem 2 (Zeckendorf). Any natural number has a unique canonical de¬ 
composition. 

Proof. The proof of this theorem is quite standard : for n = 0 we take the 
empty decomposition (and not the useless Fq), while for n > 0, we take 
first the highest k such that is less or equal to n (cf f ib_inv below), and 
continue recursively with n — F^. We cannot obtain this way two consecutive 
Ffc_|_i and F^, otherwise Ffc _|_2 could have been used in the first place. This 
process will never use F’l, since F 2 has the same value for a higher rank, and 
will hence be preferred. For proving the uniqueness, the key ingredient is 
that this kind of sum cannot exceed the next Fibonacci number. For instance 
F 2 T F'i T Fq = 1 + 3 + 8 = 12 = Fj — 1. CH 

Definition 2. For a non-empty decomposition, its lowest rank is the rank of 
the lowest Fibonacci number in this decomposition. By extension, the lowest 
rank low{n) of a number n ^ 0 is the lowest rank of its (unique) canonical 
decomposition. 

For instance low{ll) = 4. In Coq, low{n) = k is written via the predicate 
Low 2 n k. Note that this notion is in fact more general : Low Ink says 
that n can be decomposed via a relaxed decomposition of lowest rank k. 

Interestingly, a relaxed decomposition can be transformed into a canon¬ 
ical one (see Coq function norm): 

Theorem 3 (normalization). Consider a relaxed decomposition of n, made 
of k terms and whose lowest rank is r. We can build a canonical decompo¬ 
sition of n, made of no more than k terms, and whose lowest rank can be 
written r + 2p with p > 0 . 

Proof. We simply eliminate the highest consecutive Fibonacci numbers (if 
any) by summing them : F^ + Fm+i —t Fm+ 2 ) and repeat this process. By 
dealing with highest numbers first, we avoid the apparition of duplicated 
terms in the decomposition: Fm +2 wasn’t already in the decomposition, 
otherwise we would have considered Fm+i + Fra +2 instead. Hence all the 
obtained decompositions during the process are correct relaxed decomposi¬ 
tions. The number of terms is decreasing by 1 at each step, so the process is 
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guaranteed to terminate, and will necessarily stops on a decomposition with 
no consecutive Fibonacci numbers : we obtain indeed the canonical decom¬ 
position of n. And finally, it is easy to see that the lowest rank is either left 
intact or raised by 2 at each step of the process. □ 

4.2 Classifications of Fibonacci decompositions. 

The end of Fib.v deals with classifications of numbers according to the 
lowest rank of their canonical decomposition. In particular, this lowest rank 
could be 2, 3 or more. It will also be interesting to distinguish between 
lowest ranks that are even or odd. These kind of classifications and their 
properties will be heavily used during theorem 1261 which compares G and G. 
For instance: 

Theorem 4 (decomposition of successor). 

1 . low{n) = 2 implies low{n -|- 1) is odd. 

2. low{n) = 3 implies low{n -|- 1) is even and different from 2. 

3. low{n) > 3 implies low{n -|- 1) = 2. 

Proof. Let r be low{n). We can write n = Fj. + T,Fi in a canonical way. 

1. If r = 2, then n -|- 1 = F 3 -|- SF) is a relaxed decomposition, and we 
conclude thanks to the previous normalization theorem. 

2. If r = 3, then n -|- 1 = F 4 + 'FFi is a relaxed decomposition, and we 
conclude similarly. 

3. If r > 3, then n + 1 = F 2 + F^ + T,Fi is directly a canonical decompo¬ 
sition. 

□ 

We also studied the decomposition of the predecessor n — 1 in function 
of n. This decomposition depends of the parity of the lowest rank in n, since 
for r 0 we have: 


F2r — 1 = F 3 -I- ... -I- F2r-l 

F 2 r +1 — 1 = F 2 + F 4 + ... + F 2 r 

Hence: 

Theorem 5 (decomposition of predecessor). 

1. If low{n) is odd then low{n — 1) = 2. 

2. If low{n) is even and different from 2, then low{n — 1) = 3. 
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3. For n> 1, iflow{n) = 2 then low(n — 1 ) > 3 . 


Proof. 

1. Let the canonical decomposition of n be L 2 r+i + SFj. Then we have 
n — 1 = ^2 + ... + F 2 r + STj, which is also a canonical decomposition. 
Moreover r ^ 0 since Fi isn’t allowed in decompositions. Hence the 
decomposition of n— 1 contains at least the term F 2 , and low{n—l) = 2 . 

2. When low{n) = 2r with r 7 ^ 0, we decompose similarly F 2 r — 1, leading 
to a lowest rank 3 for n — 1. 

3. Finally, when n = F 2 + T^Fi, then the canonical decomposition of n — 1 
is directly the rest of the decomposition of n, which isn’t empty (thanks 
to the condition n > 1 ) and cannot starts by F 2 nor F 3 (for canonicity 
reasons). 


□ 


4.3 Subdivision of decompositions starting by F 3 

When low{n) = 3 and n > 2, the canonical decomposition of n admits at 
least a second-lowest term: n = F^ + F^ + ... and we will sometimes need to 
consider the parity of this second-lowest rank. 

Definition 3. 

• A number n is said 3-even when its canonical decomposition is of the 
form F 3 + F 2 p + .... 

• A number n is said 3-odd when its canonical decomposition is of the 
form F 3 + F 2 P +1 -h ... 

In Coq, this corresponds to the ThreeEven and ThreeOdd predicates. In 
fact, the 3-odd numbers will precisely be the locations where G and G differs 
(see theorem I26p . The first of these 3-odd numbers is 7 = T 3 -|- T 5 , followed 
by 15 = T 3 -|- Fj and 20 = T 3 -|- F 5 -|- Fj. 

A few properties of 3-even and 3-odd numbers: 

Theorem 6 . 

1. A number n is 3-odd if and only if it admits a relaxed decomposition 
of the form F 3 + F 2 p+i -h .... 

2. A number n is 3-odd if and only if low(n) = 3 and low{n — 2) is odd. 

3. A number n is 3-even if and only iflow{n) = 3 and low{n — 2) is even. 

4 . When low{n) is even and at least 6, then n — 1 is 3-odd. 
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5. Two consecutive 3-odd numbers are always apart by 5 or 8. 

Proof. 

1. The left-to-right implication is obvious, since a canonical decompo¬ 
sition can in particular be seen as a relaxed one. Suppose now the 
existence of such a relaxed decomposition. During its normalization, 
the T 3 will necessarily be left intact, and the term ^ 2^+1 can grow, but 
only by steps of 2 . 

2 . Once again, the left-to-right implication is obvious. Conversely, we 
start with the canonical decomposition of n — 2 = ^ 2^+1 -|- STj. If 
p > 1, this leads to the desired canonical decomposition of n as F 2 -\- 
F 2 P +1 + TiFi. And the case p < 1 is impossible: p = 0 leads to Fi being 
part of a relaxed decomposition, and if we assume p = 1 , then we could 
turn the decomposition of re — 2 into a canonical decomposition of re — 4 
whose lowest rank is at least 5. This gives us a relaxed decomposition 
of re of the form T 2 -|- T 4 -|- STj. After normalization, we would obtain 
that low{n) is even, which is contradictory with low{n) = 3. 

3. We proceed similarly for low{n) = 3 and low{n — 2) even implies that re 
is 3-even. First we have a canonical decomposition of re —2 = F 2 p + 'FFi. 
Then p = 0 is illegal, and p = 1 would give a relaxed decomposition of 
re starting by 3, hence low{n) even, impossible. And p > 1 allows to 
write re = F 3 -|- F 2 p T,Fi in a canonical way. 

4. When re = F 2 p-\-'FFi withp > 3, then re —1 = F^-^-F^-^-...-\-F 2 p-i-\-'FFi 
and the condition on p ensures that the terms F 3 and T 5 are indeed 
present. 

5. If re is 3-odd, it could be written T 3 -|- ^ 2^+1 -|- HTj. When p > 2, then 
re -|- 5 = T 3 -|- F 5 -|- F 2 P +1 T,Fi is also 3-odd. When p = 2, then re -|- 8 
is also 3-odd. For justifying that, we need to consider the third term 
in the canonical decomposition (if any). 

• Either re is of the form F^ + F^ + F^ + ... Then re-|-F 6 = F 3 -|- 
-^5 + -^8 + ••• is a relaxed decomposition of re -|- 8 , hence re -|- 8 is 
3-odd (see the first part of the current theorem above). 

• Either re is of the form F 3 -|- F 5 -|- ST) where all terms in SF) are 
strictly greater than F 7 . Then re -|- Eg = E 3 -|- E 7 -|- SE, is a relaxed 
decomposition of re -|- 8 , hence re -|- 8 is 3-odd. 

We finish the proof by considering all intermediate numbers between 
re and re -|- 8 and we show that re -|- 5 is the only one of them that might 
be 3-odd: 

• low{n 1 ) is even (cf. theorem |4]). 


7 


• n + 2 = -F 2 + -P 4 + F 2 P +1 + and normalizing this relaxed 

decomposition shows that low{n + 2) = 2. 

• n+3 = F^-\-F/i+F 2 p+i + TiFi, and normalizing this relaxed decom¬ 
position will either combine -F 4 with some higher terms (leading to 
an odd second-lowest term and hence n -|- 3 is 3-even) or combine 
F 4 with F^ (in which case low{n -|- 3) >5). 

• n + A = F 2 + F 3 + F 4 + F' 2 p+i + TiFi, hence low{n + 4) is even. 

• n + 6 = Fq+ i^ 2 p+i + '^Fi is a relaxed decomposition whose lowest 
rank is either 6 (when p > 2) or 5 (when p = 2). After normal¬ 
ization, we obtain low{n -|- 6) > 5. 

• Hence low{n -|- 7) = 2 by last case of theorem 01 


□ 


5 The G function 

This section corresponds to file FunG.v, 

5.1 Definition and initial study of G 

Theorem 7. There exists a unique function G : N —N which satisfies the 
following equations: 

G(0) = 0 

Vn > 0, G{n) = n — G{G{n — 1)) 

Proof. The difficulty is the second recursive call of G on G(n — 1). A priori 
G(n — 1) could be arbitrary, so how could we refer to it during a recursive 
definition of G ? Fortunately G{n — 1) will always be strictly lower than n, 
and that will allow this recursive call. More formally, we prove by induction 
on n the following statement: for all n, there exists a sequence Go.-.G^ of 
numbers in [0..n] such that Gq = 0 and V/c G [l..n],Gfc = k — Gk' where 
k' = Gk-i. 

• For n = 0, an adequate sequence is obviously Gq = 0. 

• For some n, suppose we have already proved the existence of an ad¬ 
equate sequence Gq-^.G^- In particular G„ G [0..n], hence Gg„ is 
well-defined and is also in [0..n]. Finally (n-|-l) —Gg„ is in [l..(n-|-1)]. 
We define G„+i to be equal to this (n -|- 1) — Gg„ and obviously keep 
the previous values of the sequence. All these values are indeed in 
[ 0 ..(n -|- 1 )], and the recursive equations are satisfied up to A: = n -|- 1. 


All these finite sequences that extend each other lead to an inhnite sequence 
(G*n)neN of natural numbers, which can also be seen as a function G : N —)■ N, 
that satisfy the desired equations by construction. 

For the uniqueness, we should first prove that any function / satisfying 
/(O) = 0 and our recursive equation above is such that Vn, 0 < f{n) < n. 
This proof can be done by strong induction over n, and a bit of upper and 
lower bound manipulation. Then we could prove (still via strong induction 
over n) that Vn, /(n) = g{n) when / and g are any functions satisfying our 
equations. □ 

The initial values are G(0) = 0,G(1) = G{2) = 1,G(3) = 2,G(4) = 
G(5) = 3. We can then establish some basic properties of G: 

Theorem 8. 

1. Vn, 0 < G(n) < n. 

2. Vn / 0, G{n) = G{n — 1) implies G{n + 1) = G{n) + 1. 

3. Vn, G{n + 1) — G{n) G {0,1}. 

4- Vn, m,n < m implies 0 < G{m) — G{n) < m — n. 

5. Vn, G{n) = 0 if and only if n = 0. 

6 . Vn > 1, G{n) < n. 

Proof. 

1 . Already seen during the definition of G. 

2 . G(n + 1)-G(n) = (n + l)-G(G(n)-n + G(G(n-l)) = l-(G(G(n))- 
G{G{n — 1))). If G{n) = G(n — 1) then the previous expression is equal 
to 1. 

3. By strong induction over n. First, G(l) — G(0) = 1 — 0. Then for 
a given n / 0, we assume V/c < n,G{k + 1) — G{k) G {0,1}. We 
reuse the same formulation of G(n +1) — G{n) as before. By induction 
hypothesis for A: = n — 1, G{n) — G{n — 1) G {0,1}. If it’s 0, then 
G(n +1) — G{n) = 1 as before. If it’s 1, we could use another induction 
hypothesis for k = G{n — 1) (and hence A: + 1 = G{n)), leading to 
G{G{n)) — G{G{n — 1)) G {0,1} and the desired result. 

4. Mere iteration of the previous result between n and m. 

5. For all n > 1, 0 < G{n) — G{1) and G(l) = 1. 

6 . For all n > 2, G{n) — G{2) < n — 2 and G{2) = 1. 

□ 
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We can also say that lim^ooG = +oo, since G is monotone and grows 
by at least 1 every 2 steps (any stagnation is followed by a growth). Taking 
into account this limit, and G'(O) = 0, and the growth by steps of 1, we can 
also deduce that G is onto: any natural number has at least one antecedent 
by G. Moreover there cannot exists more than two antecedents for a given 
value: by monotonicity, these antecedents are neighbors, and having more 
than two would contradict the “stagnation followed by growth” rule. We can 
even provide explicitly one of these antecedent: 

Theorem 9. Vn, G{n + G{n)) = n. 

Proof. We’ve just proved that n has at least one antecedent, and no more 
than two. Let k be the largest of these antecedents. Hence G{k) = n 
and G{k + 1) 7 ^ n, leading to G{k + 1) = G{k) + 1. If we re-inject this 
into the defining equation G{k -|- 1) = k + 1 — G{G{k)), we obtain that 
G{k) -|- G{G{k)) = k hence n + G{n) = k, and finally G{n + G{n)) = G{k) = 
n. □ 

As shown during the previous proof, n -|- G{n) is actually the largest 
antecedent of n by G. In particular G{n + G{n) -|- 1) = n -|- 1. And if n has 
another antecedent, it will hence be n -|- G{n) — 1. 

From this, we can deduce a first relationship between G and Fibonacci 
numbers. 

Theorem 10. For all k >2, G{Fk) = Fk-i- 

Proof. By induction over k. First, G{F 2 ) = G(l) = 1 = Fi. Take now a 
k > 2 and assume that G{F]fj = F)._i. Then G(F)._|_i) = G{Fk -|- Fk_i) = 
G{Fk + G{Fk)) = Fk. □ 

Moreover, for k > 2, Fk = Fk-i -|- G{Fk-i) is the largest antecedent of 
Ffc-i, hence G{1 -|- Fk) = 1 -|- Fk-i. 

We could also establish a alternative equation for G, which will be used 
during the study of function G: 

Theorem 11. For all n we have G{n) + G{G{n -|- 1) — 1) = n. 

Proof. First, this equation holds when n = 0 since G(0) -|- G(G(1) — 1) = 
G(0) -|- G(1 — 1) = 0. We consider now a number n 7 ^ 0. Either G{n -|- 1) = 
G{n) or G{n + 1) = G{n) + 1. 

• In the first case, G{n— 1) cannot be equal to G(n) as well, otherwise G 
would stay flat longer than possible. Hence G{n — 1) = G{n) — 1. Hence 
G{n)+G{G{n+l)-l) = G{n)+G{G{n)-l) = G{n)+G{G{n-l)) = n. 

• In the second case: G(n)-|-G(G(n-|-l) —1) = G(n-|- 1 ) —l-|-G(G(n)) = 
(n -|- 1 ) — 1 = n. 

□ 
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5.2 The associated G tree 

Hofstadter proposed to associate the G function with an infinite tree satis¬ 
fying the following properties: 

• The nodes are labeled by all the positive numbers, in the order of a 
left-to-right breadth-first traversal, starting at 1 for the root of the 
tree. 

• For n 7 ^ 0, the node n is a child of the node g{n). 

In practice, the tree can be constructed progressively, node after node. 
For instance, the node 2 is the only child of 1, and 3 is the only child of 2, 
while 4 and 5 are the children of 3. Now come the nodes 6 and 7 on top of 
4, and so on. The picture below represents the tree up to depth 7 (assuming 
that the root is at depth 0). 
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This construction process will always be successful: when adding the new 
node n, this new node n will be linked to a parent node already constructed, 
since G{n) < n as soon as n > 1. Moreover, G is monotone and grows by at 
most 1 at each step, hence the position of the new node n will be compatible 
with the left-to-right breadth-first ordering: n has either the same parent as 
n — 1, and we place n to the right of n — 1, or G{n) = G{n — 1) -|- 1 in which 
case n is either to be placed on the right of n — 1, or at a greater depth. 

Moreover, since G is onto, each node will have at least one child, and 
we have already seen that each number has at most two antecedents by G, 
hence the node arities are 1 or 2. 
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Tree depth 

On the previous picture, we can notice that the rightmost nodes are Fi¬ 
bonacci numbers, while the leftmost nodes have the form 1 -|- Before 
proving this fact, let us first define more properly a depth function. 

Definition 4. For a number n > 0, we note depth{n) the least number d 
such that G'^{n) (the d-th iteration of G on n) reaches 1. We complete this 
definition by choosing arbitrarily depth{ 0 ) = 0. 

For n > 0, such a number d is guaranteed to exists since G{k) < k as 
long as k is still not 1, hence the sequence G^{n) is strictly decreasing as 
long as it hasn’t reached 1. In particular, we have depth{l) = 0, and for all 
n > 1 we have depth{n) = 1 -|- depth{G{n)). Hence our depth function is 
compatible with the usual notion of depth in a tree. 

Theorem 12. 

1. For all n, depth{n) = 0 if and only if n < 1. 

2. For n > 1 and k < depth{n), we have G^{n) > 1. 

3. For n,m > 0, n < m implies depth{n) < depth{m). 

4- For k > 1, depth{Fjf) = k — 2. 

5. For k > 1, depth{l + Fk) = k — 1. 

6 . For k > 0,n > 0, depth{n) = k if and only if I + F^+i < n < Fk+ 2 - 
Proof. 

1. The statement is valid when n = 0. For a n > 0, depth{n) = 0 means 
G^{n) = 1 i.e. n = 1. 

2. Consequence of the minimality of depth. 

3. Consequence of the monotony of G : for all k, G^{n) < G^{m) hence 
the iterates of n will reach 1 faster than the iterates of m. 

4. G maps a Fibonacci number to the previous one, k — 2 iterations of G 
on Ffc leads to F 2 = 1. 

5. G maps a successor of a Fibonacci number to the previous such number, 
k — 2 iterations of G on 1 -|- leads to 1 -|- F 2 = 2. 

6. If 1 -|- Ffc_|_i < n < Ffc_|_ 2 , the monotony of depth and the previous 
facts gives k < depth{n) < /c, hence depth{n) = k. Conversely, when 
depth{n) = k, we cannot have n < 1 -|- Fk+i otherwise n < Fk+i 
and hence depth{n) < /c — 1, and we cannot have n > Fk +2 otherwise 
n > 1 -|- Fk +2 and hence depth{n) > k + 1. 
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□ 


The previous characterization of depth{n) via Fibonacci bounds shows 
that depth could also have been defined thanks to f ib_inv(n—1) —1, see[T] It 
also shows that the number of nodes at depth k is Ffc _|_2 — (1 + Fk+i) +1 = 
as soon as A; 7 ^ 0. 


The shape of the G tree 

If we ignore the node labels and concentrate on the shape of the G tree, 
we encounter a great regularity. G starts with two unary nodes that are 
particular cases (labeled 1 and 2 ), and after that we encounter a sub-tree G' 
whose shape is obtained by repetitions of the same basic pattern: 


G’ 


G’ 


G’ 



G = • and G’ = * 

G’ has hence a fractal shape: it appears as a sub-tree of itself. To prove 
the existence of such a pattern, we study the arity of children nodes. 


Theorem 13. In the G tree, a binary node has a left child which is also 
binary, and a right child which is unary, while the unique child of a unary 
node is itself binary. 


Proof. First, we show that the leftmost child of a node is always binary. Let 
n be a node, and p its leftmost child, i.e. G{p) = n and G{p — 1 ) = n — 1 . 
We already know one child q = p + G{p) of p, let’s now show that q — 1 
is also a child of p. For that, we consider p — 1 and its rightmost child 
q' = p — 1 + G{p — 1). Rightmost means that G{q' + 1) = p. So G{q — 1) = 
G{p + G{p) — 1) = G{p + n — l) = G{p + G{p — 1)) = G{q' + 1) = p and we 
can conclude. 

We can now affirm that the unique child of a unary node is itself binary, 
since this child is in particular the leftmost one. 

Let’s now consider a binary node n, with its right child p = n + G{n) 
and its left child p — 1 : G{p) = G{p — 1) = n. 
q -2 q-l q 



p — 1 p 



n 

The leftmost child p — 1 is already known to be binary. Let’s now show 
that the right child p is unary. If p has a second child, it will be g — 1, but: 


13 


G{q — 1) = G{p + G{p) — 1) = G{p — 1 + G{p — 1)) = p — 1. so g — 1 is a 
child of p — 1 rather than p, and p is indeed a unary node. □ 

5.3 G and Fibonacci decompositions 

Theorem 14. Let n = TiFi he a relaxed Fibonacci decomposition. Then 
G{n) = TiFi-i, the Fibonacci decomposition obtained by removing 1 at all 
the ranks in the initial decomposition. 

For instance G(ll) = G(F 4 + Fq) = F^ + F^ = 7. Note that the obtained 
decomposition isn’t necessarily relaxed anymore, since a Fi might have ap¬ 
peared if F ‘2 was part of the initial decomposition. The theorem also holds for 
canonical decompositions since they are a particular case of relaxed decom¬ 
positions. Once again, the obtained decompositions might not be canonical 
since Fi might appear, but in this case we could turn this Fi into a F 2 . This 
gives us a relaxed decomposition that we can re-normalize. For instance 
G(12) = G{F 2 + F 4 + Fq) = Fi + F^F F^ = F 2 + F^F F^ = F^F F^ = Fq = 8 . 

Proof. We proceed by strong induction over n. The case 0 is obvious, since 
the only possible decomposition is the empty one, and the “shifted” decom¬ 
position is still empty, as required by G'(O) = 0. We now consider a number 
n 7 ^ 0, with a non-empty relaxed decomposition n = F/^ F 'PFi, and assume 
the statement to be true for all m < n. We will use the recursive equation 
G{n) = n — G{G{n — 1)), and distinguish many cases according to the value 
of k. 

• Case k = 2. Then n — 1 = SFj. A first induction hypothesis gives 

G{n — 1) = TiFi-i. Since the initial decomposition was relaxed, SFj 
cannot contain F 2 , hence SFj_i is still a relaxed decomposition. As 
seen many times now, G{n — 1) < n here, and we’re free to use a 
second induction hypothesis leading to G{G{n — 1)) = SFj_ 2 . Finally 
G{n) = 1-1- SFj - IlFj _2 = 1-1- i;(Fi - Fj_ 2 ) = FiF as required. 

• Case k even and distinct from 2. Then k can be written 2p with p > 1. 
Then n —1 = ^ 2 ^ —l-|-SFj = F'^FF^F■.■FF 2 p-iFT‘Fi. The induction 
hypothesis on n — 1 gives: 

G{n — 1) = F 2 F F/^ F ... F F 2 p -2 F SFj_i 

Since this is still a relaxed decomposition, we use a second induction 
hypothesis on it, hence: 

G{G{n — 1)) = Fi -|- F 3 -|-... -|- F 2 p-z F 5 ^Fj _2 

G{G{n - 1)) = 1 + {F 2 P -2 - 1) + SF ,_2 

And so: 


G{n) — F2p — F2P-2 F T‘{Fi — Fj_2) — F2p-i F SFj_i 
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• Case k odd. Then k can be written 2p + 1 with j? > 0. Then n — 1 = 
F 2 p+i — 1 + TiFi = T 2 + T 4 + ... + F 2 p + STj. The induction hypothesis 
on n — 1 gives: 

G{n — 1 ) = Ti + T3 + ... + F2p-i + 

We turn the Fi above into a F 2 , obtaining again a relaxed decomposi¬ 
tion, for which a second induction hypothesis gives: 

G{G{n — 1)) = Fi + F 2 -|- ... + F 2 p -2 + 

G{G{n - 1)) = 1 + {F 2 P -1 - 1) + SF ,_2 

And so: 

G{n) = F 2 P +1 — F 2 P -1 + T,{Fi — Fi- 2 ) = F 2 p + 


□ 

Thanks to this characterization of the effect of G on Fibonacci decom¬ 
position, we can now derive a few interesting properties. 

Theorem 15. 

1. G{n -|- 1) = G{n) if and only if low{n) = 2 . 

2. If low{n) is odd, then G{n — 1) = G{n). 

3. If low{n) is even, then G{n — 1) = G{n) — 1. 

4- For n 7 ^ 0, the node n of tree G is unary if and only if low{n) is odd. 
Proof. 

1. If low{n) = 2, then we can write a canonical decomposition re = F 2 -|- 

STj, hence re-|-l = F^+TiFi is a relaxed decomposition. By the previous 
theorem, G{n) = Fi + SFj_i while G{n -|- 1) = T 2 -|- SFi_i, hence the 
desired equality. Conversely, we consider a canonical decomposition of 
re as TiFi. If F 2 isn’t part of this decomposition, then re -|- 1 = T 2 -|- STj 
is a correct relaxed decomposition, leading to G{n -|- 1) = 1 -b = 

1 -b G{n). So low{n) / 2 implies G{n -b 1) / G{n). 

2. If low{n) is odd, we’ve already shown in theorem [5] that low{n — 1) = 2 
hence G{n) = G(re — 1 ) by the previous point. 

3. If low{n) is even, the same theorem [5] shows that low{n — 1) is 3 or 
more, provided that re > 1. In this case the first point above allows 
to conclude. We now check separately the cases re = 0 (irrelevant here 
since low{0) doesn’t exists) and re = 1 (for which G{1 — 1) is indeed 
G(l)-l). 
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4. We’ve already seen that a node n is binary whenever G{n + G{n) — 1) = 
n. When low{n) is odd, we have G{n — 1) = G{n), hence G{n + G{n) — 
1) = G{n — 1 + G{n — 1)) = n — 1, hence n is unary. When low{n) is 
even, we have n + G{n) — 1 = (n — 1 + G{n — 1)) + 1 : this is the next 
node to the right after the rightmost child of n — 1, its image by G is 
hence n, and finally n is indeed binary. 

□ 

Theorem 16. If low(n) > 2, then low{G{n)) = low{n) — 1. In particular, 
if low(n) is odd, then low{G{n)) is even, and iflow{n) is even and different 
from 2, then low{G{n)) is odd. 

Proof. This is a direct application of theorem [T31 when low{n) > 2, the 
decomposition we obtain for G{n) is still canonical, and its lowest rank is 
low{n) — 1. The statements about parity are immediate consequences. □ 

Theorem 17. 

1. If low{n) = 2 , then low{G{n)) is even. 

2. If low(n) = 3, then low{G{n)) = 2. 

3. If low(n) > 3, then low{G{n)) > 2 and low{G{n) + 1) is even. 

Proof. 

1. Once again, we consider a canonical decomposition n = + Then 

G{n) = Fi + STj_i. If we turn the Fi into a F 2 , we obtain a relaxed 
decomposition, that can be normalized into a canonical decomposition 
whose lowest rank will hence be even. 

2. Direct application of previous theorem: low{G{n)) = low{n) — 1. 

3. Here also, low{G{n)) = low{n) — 1, hence low{G{n)) cannot be 2 here. 
Then the theorem 0] implies that low{G{n) + 1) is even. 


□ 


Theorem 18. 

1. If n is 3-even, then G{n) + 1 is 3-odd. 

2. If n is 3-odd, then either G{n) + 1 is 3-even or low{G{n) + 1) > 3. 
Proof. 

1. Take a canonical decomposition n = F^-IF 2 p-\-TiFi. Hence G(n) + 1 = 
1 + F 2 + F 2 p-i + = F 3 + F 2 p-i + STi_i, and this decomposition 

is still canonical (for canonicity reasons, p > 2). 
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2 . Similarly, n = F 3 + F 2 p+i + SFj implies G{n) + 1 = F 3 + F 2 p + 

When p > 1, this decomposition is canonical and G{n) + 1 is 3-even. 
When p = 1, this decomposition is only a relaxed one, starting by 
.^3 + .^4 + • • • • Its normalization will hence end with a lowest rank of at 
least 5. 

□ 

5.4 G and its “derivative” AG 

We consider now the “derivative” AG of G, defined via AG{n) = G{n + 1) — 
G{n). We already know from theorem [8] that the output of AG is always 
either 0 or 1. 

Theorem 19. For all n, AG(n -|- 1) = 1 — AG(n).AG(G(n)). 

Proof. We already know that AG(n) = 0 implies AG(re-|- 1) = 1: we cannot 
have G(n) = G(n -|- 1) = G(n -|- 2). Consider now a n such that G(n -|- 1) — 
G(n) = 1. By using the recursive definition of G, we have G(n + 2) — G(n -|- 
1) = (n + 2-G(G(n + l)))-(n + l-G(G(n))) = l-(G(G(n + l))-G(G(n)). 
Hence AG(n + 1) = 1 - (G(G(n) + 1) - G(G(n)) = 1 - AG(G(n)). □ 

This equation provides a way to express G(n -|- 2) in terms of G(n -|- 1) 
and G(n) and G(G(n)) and G(G(n) -|- 1). Since n-|- 1, n, G{n) and G(n) -|- 1 
are all strictly less than n + 2 , we could use this equation as an alternative 
way to define recursively G, alongside two initial equations G(0) = 0 and 
G(l) = 1. We proved in Coq that G is indeed the unique function to satisfy 
these equations (see GD_unique and g_iinpleinents_GD). 

6 The G function 

This section corresponds to file FlipG.v, Here is a quote from page 137 of 
Hofstadter’s book [T]: 

A problem for curious readers is: suppose you flip Diagram G 
around as if in a mirror, and label the nodes of the new tree so 
they increase from left to right. Can you find a recursive algebraic 
definition for this “flip-tree” ? 

6.1 The flip function 

Flipping the G tree as if in a mirror is equivalent to keeping its shape un¬ 
changed, but labeling the nodes from right to left during the breadth-first 
traversal. Let us call flip{n) the new label of node n after this transforma¬ 
tion. We’ve seen in the previous section that given a depth k ^ 0, the nodes 
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at this depth are labeled from 1 + F^+i till -Ffc+ 2 - After the flip transfor¬ 
mation, the 1 -|- -Ffc+i node and the -Ffc +2 node will hence have exchanged 
their label. Similarly, 2 -|- F^+i will become Fk +2 — 1 and vice-versa. More 
generally, if depth{n) = fe, the distance between flip{n) and the leftmost 
node 1 -|- Fk+i will be equal to the distance between n and the rightmost 
node Fk+ 2 , hence: 


flip{n) - (1 Fk+i) = Fk +2 - n 


So: 

flip{n) = 1 + Fk +3 - n 

We finally complete this definition to handle the case n < 1: 

Definition 5. We define the funetion flip : N —)■ N in the following way: 
flip{n) = if {n< 1) then n else 1 F^+^epthin) - n 
A few properties of this flip function: 

Theorem 20. 


1. For all n G N, depth(flip(n)) — depth(n). 

2. For all n, n > 1 if and only if flip{n) > 1. 

3. For 1 < n < Fk, we have flip{Fk+i -|- n) = 1 -|- Fk +2 — n. 
flip is involutive. 

5. For all n > 1, if depth{n+l) = depth{n) then flip{n+l) = flip{n) — l. 

6 . For all n > I, if depth{n—l) = depth{n) then flip{n—l) = flip{n) + l. 

Proof. 

1. If n < 1, then flip{n) = n and the property is obvious. For n > 1, 

if we name k the depth of n, we have flip{n) = 1 -|- Fk+s — n. Since 
1 -I- Fk+i < n < Fk+ 2 , we hence have 1 -|- Fk+s - Fk +2 < flip{n) < 

1 Fk +3 - 1 - Fk+i- And finally 1 -h Fk+i < flip{n) < Fk+ 2 , and 

this characterizes the nodes at depth k, hence depth{flip{n)) = k = 
depth{n). 

2. We know that n and flip{n) have the same depth, and we’ve already 
seen that being less or equal to 1 is equivalent to having depth 0. 

3. Consider 1 < n < F^. Hence k is at least 1, otherwise 1 < Fq = 0. 
We have 1 -|- Fk+i < F^+i + n < Fk+i + Fk = Fk+ 2 , so the depth of 
Fk+i -|- n is A: (still via the same characterization of nodes at depth 
k). Moreover F^+i -|- n is more than 2, so the definition of flip gives: 
flip{Fk+i + n) = 1 + Fk +3 - Fk+i - n = l + Fk +2 - n. 
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4. If n < 1 then flip{n) = n is still less or equal to 1, so flip{flip{n)) = 

flip{n) = n. Consider now n > 1. We already know that flip{n) > 
1 and flip{n) and n have same depth (let us name it k). Hence: 
flip{flip{n)) = 1 + Ffc +2 - flip{n) = 1 + - (1 + F 2 +k - n) = n. 

5. Let us name k the common depth of n + 1 and n. In these conditions 
flip{n + !) = ! + Fk +2 - (n + 1) = (1 + Fk +2 - n) - 1 = flip{n) - 1. 

6. Similar proof. 


□ 

In particular, the third point above shows that for A: > 1 we indeed have 
flip{l + Fk) = Fk+i (and vice-versa since flip is involutive). 

6.2 Definition of G and initial properties 

We can now take advantage of this flip function to obtain a first definition 
of the G function, corresponding to the flipped G tree: 

Definition 6. The function G : N —)• N is defined via: 

Gin) = flipiGiflipin)) 

We benefit from the involutive aspect of flip to switch from right-left to 
left-right diagrams, use G, and then switch back to right-left diagram. The 
corresponding Coq function is named fg. 

By following this definition, the initial values of G are G(0) = 0, G(l) = 
G{ 2 ) = 1, G{3) = 2. The first difference between G and G appears for 
G{7) = 5 = G{7) + 1. We’ll dedicate a whole section later on the comparison 
between G and G. Here is the initial levels of this flipped tree G, the boxed 
values being the places where G and G differ. 
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We now show that G enjoys the same basic properties as G: 

Theorem 21. 

1. For all n > 1, depth{G{n)) = depth{n) — 1. 

2. For all k > 0, G(Ffc_|_i) = F^. 

3. For all k > 1, G{1 + = 1 + 

4- For all n, G{n + 1) — G{n) G {0,1}. 

5. For all n,m, n <m implies 0 < G{m) — G{n) < m — n. 

6 . For all n, 0 < G{n) < n. 

7. For all n, G{n) = 0 i/ and only if n = 0 

8 . For all n > 1, G{n) < n. 

9. For all n / 0, G(n) = G{n — 1) implies G{n + 1) = G{n) + 1. 

10. G is onto. 

Proof. 

1. Since flip doesn’t modify the depth, we reuse the result about the 
depth of G{n). 

2 . G{Fk+i) = flip{G{flip{Fk+i))) = flip{G{l + Fk)) = flip{l + Fk-i) = 
Fk- 
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3. Similar proof. 

4. The property is true for n = 0 and n = 1. Consider now n > 1. Let k 

be depth{n), which is hence different from 0. We have 1 + F^+i < n < 
Tfc_l_ 2 . If n is Ta:+ 2 ) we have already seen that G(l + Tfc_|_ 2 ) = l + = 

1 + G(Tfc_|_ 2 ). Otherwise n < Fk +2 and n + 1 has hence the same depth 
as n. Then flip{n + 1) = flip{n) — 1. Now, by applying G on the two 
consecutive values of same depth flip{n) and flip{n) — 1, we obtain 
two results that are either equal or consecutive, and of same depth. By 
applying flip again on these G results, we end on two consecutive or 
equal G results. 

5. Iteration of the previous results between n and m. 

6. Thanks to the previous point, 0 < G{m) — G{0) < m — 0 and G(0) = 0. 

7. G(l) = 1 and 0 < G{m) — 0(1) as soon as m > 1. 

8. 0(2) = 1 and G{m) — 0(2) < m — 2 as soon as m > 2. 

9. If n = 1, then the precondition 0(1) = 0(0) isn’t satisfied. If n = 2 

or n = 3, the conclusion is satisfied: 0(3) = 2 = 0(2) + 1 and 0(4) = 
3 = 0(3) + 1. We now consider n > 3 such that 0(n) = G{n — 1). 
We have already seen that G{n + 1) — 0(n) is either 0 or 1. If it is 
1, we can conclude. We proceed by contradiction and suppose it is 
0. We hence have G{n — 1) = 0(n) = 0(n + 1), or said otherwise 
flip{G{flip{n — l))) = flip{G{flip{n))) = flip{G{flip{n + l))). Since 
flip is involutive hence bijective, this implies that G{flip{n — 1)) = 
G{flip{n)) = G{flip{n + 1)). To be able to commute flip and the 
successor/predecessor, we now study the depth of these values. 

• If n + 1 has a different depth than n, then n is a Fibonacci number 
Ffc, with /c > 4 since n > 3. Then flip{n + 1) = F^+i and 
flip{n) = 1 + Fk_i. These values aren’t equal nor consecutive, 
hence O cannot give the same result for them, this situation is 
indeed contradictory. 

• Similarly, if n — 1 has a different depth than n, then n — 1 is a 
Fibonacci number Fi^, with k > 3 since n > 3. Then flip{n — l) = 
1 + Fk-i while flip{n) = F^+i- Once again, these values aren’t 
equal nor consecutive, hence G cannot give the same result for 
them. Contradiction. 

• In the last remaining case, depth{n — 1) = depth{n) = depth{n + 
1). Then flip{n + 1) = flip{n) — 1 and flip{n — 1) = flip{n) + 1 
and these value are at distance 2, while having the same result by 
G, which is contradictory. 
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10. We reason as we did earlier for G: G starts with G(0) = 0, it grows by 
steps of 0 or 1, and it grows by at least 1 every two steps. Hence its 
limit is +00 and it is an onto function. 

□ 

6.3 An recursive algebraic definition for G 

The following result is an answer to Hofstadter’s problem. It was already 
mentioned on OEIS page [5], but only as a conjecture. 

Theorem 22. For all n > 3 we have G{n) = n + 1 — G{1 + G{n — 1)). And 
G is uniquely characterized by this equation plus initial equations G'(O) = 0, 
G(l) = G{2) = 1 andG{3) = 2. 

Proof. We consider n > 3. Let k be its depth, which is hence at least 3. We 
know that 1 + Tfc+i < n < Tfc_|_ 2 . We will note flip{n) below as n. Roughly 
speaking, this proof is essentially a use of theorem [11] which implies that 
G{G{n + 1) — 1) = n — G{n), and then some play with flip and predecessors 
and successors, when possible, or direct particular proofs otherwise. 

• We start with the most general case : we suppose here depth{n — 1) = 
depth(n) and depth{G{n — 1) + 1) = depth{G{n — 1). Let us shorten 
G(n—1) + 1 asp. In particular depth{p) = depth{G{n—l)) = depth{n— 
1) — 1 = depth{n) — 1 = k — 1. Due to the equalities between depths, 
and the facts that n > 1 and G{n — 1) > 1, we’re allowed to use the 
last properties of theorem [20| : flip{n — 1) = n + 1 and 

flip{p) = flip{G{n - 1) + 1) 

= flip{G{n - 1)) - 1 
= flip{flip{G{flip{n - 1)))) - 1 
= G{flip{n - 1)) - 1 
= G(n + 1) - 1 

We now exploit the definition of flip three times: 

1. depth{n) = A: / 0 implies re = 1 + — re 

2. depth{G{flip{p))) = depth{p) — l = k — hence: 

G{p) = flip{G{flip{p))) = 1 + Fk+i - Giflipip)) 

3. depth{G{n)) = depth{n) — 1 = depth{n) — 1 = A: — 1 0, so: 

Gin) = flip{G{n)) = 1 + Fk +2 - Gin) 
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All in all: 

n + 1 — G{1 + G{n — 1)) = n + 1 — G{p) 

= n + 1 - (1 + Ffc+i - G{flip{p))) 

= n — Ffc_|_i + G{G{n + 1) — 1)) 

= n - Ffc+i + (n - G{n)) 

= n- Ffc+i + (1 + Ffc +3 - n) - G(n) 

= 1 + -^fc+2 ~ G'(^) 

= G(n) 

• We now consider the case where depth{n — 1) ^ depth{n). So n is the 
least number of depth k, hence n = 1 + F^+i- In this case: 

1 + n — G{1 + G{n — 1)) = 2 + F^+i — G{1 + G{Fk+i)) 

= 2 + Ffc_|_i — G(1 + Fk) 

= 2 + Ffc_|_i — (1 + Ffc_i) 

= 1 + 

= G{n) 

• The last case to consider is depth{n — 1) = depth{n) but depth{G{n — 
1) +1) / depth{G{n — 1)). As earlier, we prove that depth{G{n — 1)) = 
depth{n — 1) — 1 = depth{n) — 1 = k — 1. So G{n — 1) is the greatest 
number of depth k — 1, hence G{n — 1) = Fk+i- Since G{Fk+ 2 ) is also 

then n — 1 and Ffc _|_2 are at a distance of 0 or 1. But we know 
that depth{n) = k, hence n < Fk+ 2 , so n — 1 < Fk+ 2 - Finally n = Fk +2 
and: 


1 + n — G{1 + G{n — 1)) = 1 + Ffc _|_2 — G{1 + Ffc+i) 

= 1 + -Ffc+2 “ (1 + -Ffc) 

= Ffc+i 

= G{n) 

Finally, if we consider another function F satisfying the same recursive 
equation, as well as the same initial values for n < 3, then we prove by 
strong induction over n that Vn, F(n) = G{n). This is clear for n < 3. 
Consider now some n > 3, and assume that F{k) = G{k) for all k < n. 
In particular F{n — 1) = G{n — 1) by induction hypothesis for n — 1 < n. 
Moreover n — 1 > 1 hence G{n — 1) < n — 1 hence G{n — 1) + 1 < n. So we 
could use a second induction hypothesis at this position: F{G{n — 1) + 1) = 
G{G{n — 1) + 1). This combined with the first induction hypothesis above 
gives F{F{n — 1) + 1) = G{G{n — 1) + 1) and finally F{n) = G{n) thanks 
to the recursive equations for F and G. □ 
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6.4 The G tree 


Since G has been obtained as the mirror of G, we already know its shape: 
it’s the mirror of the shape of G. We’ll nonetheless be slightly more precise 
here. The proofs given in this section will be deliberately sketchy, please 
consult theorem unary_flip and alii in file FlipG.v for more detailed and 
rigorous justifications. 

First, a node n has a child p in the tree G is and only if the node flip(n) 
has a child flip{p) in the tree G. Indeed, G{p) = n iff flip{G{flip{p))) = n 
iff G{flip{p)) = flip{n). Moreover, a rightmost child in G corresponds by 
flip with a leftmost child in G and vice-versa. Indeed, if p and n aren’t on 
the borders of the trees, then the next node will become the previous by flip, 
and vice-versa (see theorem [ 20 ]). And if p and/or n are on the border, they 
are Fibonacci numbers or successors of Fibonacci numbers, and we check 
these cases directly. 

Similarly, the arity of n in G is equal to the arity of flip{n) in G. For 
instance, if we take a unary node n and its unique child p in G, this means 
that G{p -|- 1) = n -|- 1 and G{p — 1) = n — 1. In the most general case flip 
will lead to similar properties about flip{p) and flip{n) in G, hence the fact 
that flip(n) is unary in G. And we handle particular cases about Fibonacci 
numbers on the border as usual. 

Theorem 23. For all n > 1, flip{flip{n) + G{flip{n))), which could also 
be written flip{flip{n) + flip{G{n))), is the leftmost child of n in the G tree. 

Proof. See the previous paragraph: flip(n) + G{flip{n)) is rightmost child 
of flip{n) in G, hence the result about G. □ 

Theorem 24. For all n > 1, n — 1 -|- G{n + 1) is the rightmost child of n in 
the G tree. 

Proof. We already know that all nodes n in the G tree have at least one 
antecedent, and no more than two. Take n > 1, and let k be the largest 
of its antecedents by G. Hence G{k) = n and G{k -|- 1) 7 ^ n, leading to 
G{k -|- 1) = n -|- 1. If we re-inject this into the previous recursive equation 
G{k -|- 1) = A: -|- 2 — G{G{k) + 1), we obtain that n + 1 = k + 2 — G{n -|- 1) 
hence k = n — 1 + G{n + 1). □ 

Of course, for unary nodes, there is only one child, hence the leftmost 
and rightmost children given above coincide. Otherwise, for binary nodes, 
they are apart by 1 . 

Theorem 25. In the G tree, a binary node has a right child which is also 
binary, and a left child which is unary, while the unique child of a unary 
node is itself binary. 

Proof. Flipped version of theorem (TH] □ 
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6.5 Comparison between G and G 

We already noticed earlier that G and G produce very similar answers. Let 
us study this property closely now. 

Theorem 26. For all n, we have G{n) = 1 + G{n) whenever n is 3-odd, 
and G{n) = G{n) otherwise. 

Proof. We proceed by strong induction over n. When n < 3, n is never 3-odd, 
and we indeed have G{n) = G{n). We now consider n > 3, and assume that 
the result is true at all positions strictly less than n. For comparing G{n) 
and G{n) we use their corresponding recursive equations: when n is 3-odd 
we need to prove that G{G{n — 1) -|- 1) = G{G{n — 1)), and otherwise we 
need to establish that G{G{n — 1) -|- 1) = 1 -|- G{G{n — 1)). So we’ll need 
induction hypotheses (IH) for n — 1 < re and for G{n — 1) -|-1 (which is indeed 
strictly less than re: since 1 < re — 1, we have G{n — 1) < re — 1). But the 
exact equations given by these two IH will depend on the status of re — 1 and 
G{n — 1) -|- 1 : are these numbers 3-odd or not ? For determining that, we’ll 
consider the Fibonacci decomposition of re and study its classification. 

• Case low{n) = 2. Hence re isn’t 3-odd and we try to prove G{G{n — 
1) -|- 1) = 1 -|- G{G{n — 1)). Theorem [5] implies that low{n — 1) > 3, 
and in particular re — 1 isn’t 3-odd. So the first IH is G{n — 1) = 
G(re — 1). By theorem 1151 we also know that G(re — 1) = G{n) — 1. So 
G{n — 1) -|- 1 = G{n — 1) -|- 1 = G{n) By theorem [l3 we know that 
G{n) has an even lowest rank, so it cannot be 3-odd, and the second 
IH is G{G{n — 1) -|- 1) = G{G{n — 1) -|- 1). Since low{G{n)) is even, we 
can use theorem 1151 once more: G{G{n) — 1) = G{G{n)) — 1. Finally: 
G(G{n - 1) + 1) = G(G{n - 1) + 1) = G{G{n)) = 1 + G{G{n - 1)). 

• Case re 3-odd. We’re trying to prove G{G{n — 1) -|- 1) = G{G{n — 1)). 
Theorem [5] implies that low{n — 1) = 2, and in particular re — 1 isn’t 
3-odd. So the first IH is G{n — 1) = G{n — 1). By theorem [151 
we also know that G(re — 1) = G{n). Moreover theorem 1181 shows 
that G{n — 1) -|- 1 = G'(re) -|- 1 cannot be 3-odd. So the second IH 
gives G{G{n — 1) -|- 1) = G{G{n — 1) -I- 1). Now, low{G{n)) = 2 by 
theorem EZl so G{G{n) -|- 1) = G{G{n) by theorem 1151 or equivalently 
G{G{n — 1) -|- 1) = G{G{n — 1)), hence the desired equation. 

• Case re 3-even. As in all the remaining cases, we’re now trying here to 
prove G{G{n — 1) -|- 1) = 1 -|- G{G{n — 1)). This case is very similar 
to the previous one until the point where we study G'(re) -|- 1 which is 
now 3-odd (still thanks to theorem llSp . So the second IH gives now 
G{G{n— 1) -I-1) = 1 -l-G(G(re— 1) -|-1). And we conclude just as before 
by ensuring for the same reasons that G{G{n — 1) -|- 1) = G{G{n — 1)). 
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• Case low{n) = 4. Since 4 is even, we also have G{n — 1) = G{n) — 1. 
Hence low{G{n)) = 3 by theorem [T 6 l and G{G{n) — 1) = G{G{n)) and 
G{G{n) + 1) = l + G{G{n)), both bv theorem fT5l Finally l + G{G{n — 
1)) = 1 + G{G{n)) = G{G{n) + 1). Now, to determine whether n — 1 
is 3-odd, we need to look deeper in the canonical decomposition of 
n = F 4 -|- Ffc -|- YjFi. 

— If the second lowest rank k is odd, then n — 1 = F'i -\- + T,Fi 

is hence 3-odd, and the first IH is G{n — 1) = 1 -|- G{n — 1). So 
G{n — 1) + 1 = G{n) -)- 1, and this number has an even lowest 
rank (theorem [T71) . it cannot be 3-odd, and the second IH is: 
G{G{n — 1) -|- 1) = G{G{n — 1) -|- 1). And this is known to be 
equal to G{G{n) -|- 1) = 1 -|- G{G{n — 1)). 

— Otherwise k is even and n —1 is 3-even and hence not 3-odd, so the 
first IH is G(n—1) = G(n—1). Moreover, since n—1 is 3-even then 
G{n—1) + 1 = G{n—1) + 1 is 3-odd by theorem [TBJ So the second 
IH is: ^(^(n —1)-|-1) = l-|-G'(G(n—1)-|-1). And this is known to 
be equal to l-|-G(G(n — 1)-|-1) = l-|-G'(G(n)) = l-|-G'(G(n— 1)). 

• Case low{n) > 4 and odd. By theorem [SI low{n — 1) = 2 hence the 
first IH is G{n — 1) = G{n — 1). We also know that G{n — 1) = G{n) 
by theorem [TSl and that low{G{n — 1 ) -f 1 ) = low{G{n) + 1 ) is even by 
theorem [T71 The second IH is hence: G(G(n—1)-|-1) = G(G(n—1)-|-1). 
Finally G{G{n) -f 1) = 1 -f G{G{n)) since low{G{n)) = 2 / 1 . 

• Case low{n) > 4 and even. In this case, n — 1 is 3-odd (theorem 
ED, so the first IH is G{n — 1) = 1 -|- G{n — 1). We also know that 
G{n — 1 ) = G{n) — 1 by theorem [TSJ By theorem [16] we know that 
low{G{n)) = low{n) — 1 > 3, so by theorem [3]/orc(G(n) -f 1 ) = 2 , and 
G{n — 1) -|- 1 = 1 -|- G{n) cannot be 3-odd : the second IH is hence 
G{G{n—l) + l) = G{G{n — l) + l). Moreover Zorc(G(n)) is also known 
to be odd, so by theorem 1151 we have G{G{n — 1)) = G{G{n) — 1 ) = 
G{G{n)). The final step is G{l + G{n)) = l-|-G(G(n)) also by theorem 
[TSl (since low{G{n)) / 2 ). 

□ 

As an immediate consequence, G is always greater or equal than G, but 
never more than G + 1. And we’ve already studied the distance between 
3-odd numbers, which is always 5 or 8 , while the hrst 3-odd number is 7. So 
G and G are actually equal more than 80% of the time. 

6.6 G and its “derivative” AG 

We consider now the “derivative” AG of G, defined via AG(n) = G(n-|- 1) — 
G(n). This study will be quite similar to the corresponding section [53] for 
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G. We already know from theorem 12 II that the output of AG is always either 
0 or 1 . 

Theorem 27. For all n > 2, AG(n + 1) = 1 — AG(n).AG(G(n + 1)). 

Proof. We already know that AG(n) = 0 implies AG(n+ 1) = 1: we cannot 
have G(n) = G(n + 1) = G(n + 2). Consider now some n > 2 such that 
G(n + 1) — G(n) = 1. By using the recursive equation of G for n + 1 > 3 
and n + 2 > 3, we have as expected 

AG(n + 1) = G(n + 2 ) - G(n + 1) 

= (n + 3 — G{G{n + 1) + 1)) — (n + 2 — G(G{ji) + 1)) 

= 1 — {G{G{n + 1 ) + 1 ) — G{(G{n) + 1) 

= 1 — (G{G{n + 1) + 1) — G{(G{n + 1)) 

= 1 - AG(G(n + 1)) 


□ 

Note: when compared with theorem [19] about AG, the equation above 
looks really similar, but has a different inner call (AG(G(n + l)) instead of 
AG(G(n))), and doesn’t hold for n = 2. 

For n > 2, this equation provides a way to express G(n + 2) in terms of 
G(n + 1) and G(n) and G(G(n + 1)) and G(G(n + 1) + 1). For n > 1 , we 
know that G{n + 1) < n + 1 hence G{n + 1) + 1 < n + 2. It is also clear that 
n + 1 and n and G(n + 1) are all strictly less than n + 2 . So we could use 
this equation as an alternative way to define recursively G, alongside initial 
equations G(0) = 0 and G(l) = G(2) = 1 and G(3) = 2 and G(4) = 3. We 
proved in Coq that G is indeed the unique function to satisfy these equations 
(see FD_unique and f g_implements_FD). 

6.7 An alternative recursive equation for G 

During the search for an algebraic definition of G, we first discovered and 
proved correct the following result, which unfortunately doesn’t uniquely 
characterize G. We mention it here nonetheless, for completeness sake. 

Theorem 28. For all n > 3 we have G(n — 1) + G(G(n)) = n. 

Proof. We proceed as for theorem 1221 except that we use internally the orig¬ 
inal recursive equation for G instead of the alternative equation of theorem 
EH We take n > 3, call k its depth (which is greater or equal to 3), and note 
n = flip{n). First: 

G{G{n)) = flip{G{flip{flip{G{n))))) = flip{G{G{n))) 


27 


The depth of G{G{n)) is /c — 2 7 ^ 0, so the definition of flip gives: 

G{G{n)) = flip{G{G{n))) = 1 + F^+i - G{G{n)) 

Now, thanks to the recursive definition of G at n + 1, we have 
G{G{n)) = 1 + -Ffc+i + G{n + 1) — n. — 1 

• If depth{n — 1) = k then flip{n — 1) = n + 1 and we conclude by more 
uses of the flip definition: depth{G{n + 1)) = depth{G{flip{n — 1)) = 
depth{n — 1 ) — 1 = A: — 1 , so: 

Gin-1) = flipiGiflipin-1))) = //zp(G(n+l)) = l+Ffc+ 2 -G'(n + l) 
And finally: 

G(G(n)) = Ffc+i + (l + Ffc _|_2 —G(n —1)) —(1 + Ffc _|_3 —n) = n —G(n —1) 

• If depthin — 1) k then n is the least number at depth k, so n = 
1 + Ffc+i, and: 

G(n-l)+G(G(n)) = G(Ffc+i) + G(G(l + Ffc+i)) = Ffc + (l+Ffc_i) =n 

□ 

Note that the following function / : N —)■ N also satisfies this equation, 
even with the same initial values than G: 

/(O) = 0 

/( 1 ) = /( 2 ) = 1 

/(3) = 2 

/(4) = /(5) = 3 

/( 6 ) = 5 

/(7) = 3 

Vn < 4, /(2n) = n — 2 
Vn < 4, /(2n + 1 ) = 4 

This function isn’t monotone. We actually proved in Coq that G is the 
only monotone function that satisfies the previous equation and the ini¬ 
tial constraints Oi-t- 0, li—)-l, 2i—)-l, 3i— 7-2 (see alt_inono_unique and 
alt_inono_is_fg). We will not detail these proofs here, the key ingredient 
is to prove that any monotone function satisfying these equations will grow 
by at most 1 at a time. 
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7 Conclusion 


The proofs for theorems [22] and [26] are surprisingly tricky, all our attempts 
at simplifying them have been rather unsuccessful. But there’s probably still 
room for improvements here, please let us know if you find or encounter nicer 
proofs. 

Perhaps using the dehnition of G via real numbers could help shortening 
some proofs. We proved in file Phi. v this definition Vn, G{n) = where 

If is the golden ratio (1 + y/b)/2. But this has been done quite late in our 
development, and we haven’t tried to use this fact for earlier proofs. Anyway, 
relating this definition with the flipped function G doesn’t seem obvious. 

Another approach might be to relate G more directly to some kind of 
Fibonacci decomposition. Of course, now that theorem [26] is proved, we 
know that G shifts the ranks of the Fibonacci decompositions just as G, 
except for 3-odd numbers where a small -|-1 correction is needed. But could 
this fact be established more directly ? For the moment, we are only aware 
of the following “easy” formulation of G via decompositions: if n is written 
as Ffc _|_2 — TiFi where k = depth{n) and STj form a canonical decomposition 
of Ffc _|_2 — n, then G{n) — Fk+i — TiFi-i + e, where e = 1 whenever the 
decomposition above includes F 2 , and e = 0 otherwise. But this statement 
didn’t brought any new insights for the proof of theorem [26l so we haven’t 
formulated it in Coq. 

As possible extensions of this work, we might consider later the other 
recursive functions proposed by Hofstadter : 

• H defined via F[{n) = n — — 1))), or its generalized version 

with an arbitrary number of sub-calls instead of 2 for G and 3 for H. 

• H, the flipped version of H. 

• M and T, the mutually recursive functions (“Male” and “Female”). 

We’ve already done on paper a large part of the analysis of M and F, and 
should simply take the time to certify it in Coq. The study of H and H 
remains to be done, it looks like a direct generalization of what we’ve done 
here, but surprises are always possible. 
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